Detection of Compound Nouns and Light Verb Constructions using IndoWordNet
نویسندگان
چکیده
Detection of MultiWord Expressions (MWEs) is one of the fundamental problems in Natural Language Processing. In this paper, we focus on two categories of MWEs Compound Nouns and Light Verb Constructions. These two categories can be tackled using knowledge bases, rather than pure statistics. We investigate usability of IndoWordNet for the detection of MWEs. Our IndoWordNet based approach uses semantic and ontological features of words that can be extracted from IndoWordNet. This approach has been tested on Indian languages viz., Assamese, Bengali, Hindi, Konkani, Marathi, Odia and Punjabi. Results show that ontological features are found to be very useful for the detection of light verb constructions, while use of semantic properties for the detection of compound nouns is found to be satisfactory. This approach can be easily adapted by other Indian languages. Detected MWEs can be interpolated into WordNets as they help in representing semantic knowledge.
منابع مشابه
Multiword Expressions Dataset for Indian Languages
Multiword Expressions (MWEs) are used frequently in natural languages, but understanding the diversity in MWEs is one of the open problem in the area of Natural Language Processing. In the context of Indian languages, MWEs play an important role. In this paper, we present MWEs annotation dataset created for Indian languages viz., Hindi and Marathi. We extract possible MWE candidates using two r...
متن کاملDetecting Noun Compounds and Light Verb Constructions: a Contrastive Study
In this paper, we describe our methods to detect noun compounds and light verb constructions in running texts. For noun compounds, dictionary-based methods and POStagging seem to contribute most to the performance of the system whereas for light verb constructions, the combination of POStagging, syntactic information and restrictions on the nominal and verbal component yield the best result. Ho...
متن کاملAn Analysis of Persian Compound Nouns as Constructions
In Construction Morphology (CM), a compound is treated as a construction at the word level with a systematic correlation between its form and meaning, in the sense that any change in the form is accompanied by a change in the meaning. Compound words are coined by compounding templates which are called abstract schemas in CM. These abstract constructional schemas generalize over sets of existing...
متن کاملDeverbal Nouns in Czech Light Verb Constructions
In this paper, we provide a well-founded description of Czech deverbal nouns in both nominal and verbal structures (light verb constructions), based on a complex interaction between the lexicon and the grammar. We show that light verb constructions result from a regular syntactic operation. We introduce two interlinked valency lexicons, NomVallex and VALLEX , demonstrating how to minimize the s...
متن کاملTwo Types of Korean Light Verb Constructions in a Typed Feature Structure Grammar
In this paper, I present a lexical representation of the light verb ha 'do' used in two types of Korean light verb constructions (LVCs). These two types of the constructions have the typical theoretical and implementation problems as multiword expressions (MWEs): lexical proliferation of the possible light verb senses in the lexicon, potential overgeneration of illformed LVCs, and the semantic ...
متن کامل